智能论文笔记

Context-self contrastive pretraining for crop type semantic segmentation

Michail Tarasiou , Riza Alp Guler , Stefanos Zafeiriou

分类：计算机视觉 | 机器学习

2021-04-09

在本文中，我们提出了一种基于对比学习的完全监督的预培训方案，特别针对密集的分类任务。所提出的上下文 - 自我对比损失（CSCL）了解嵌入空间，通过在训练样本中的每个位置与其本地上下文之间使用相似性度量来弹出语义边界。对于从卫星图像时间序列（坐）的作物类型语义分割我们在宗地边界中发现性能是一个关键的瓶颈，并解释CSCL如何解决该问题的潜在原因，从而提高本任务中的最先进的性能。此外，我们使用来自Sentinel-2（S2）卫星任务的图像，我们编写了我们的知识，坐在裁剪类型和包裹身份密集地注释的数据集，我们将与数据生成管道一起公开使用。使用我们发现CSCL的数据，即使具有最小的预训练，以改善所有相应的基线，并且在超级分辨率下提出语义分割的过程，以获得更粒度的茶几。下载数据的代码和说明可以在https://github.com/michaeltrs/deepsatmodels中找到。

translated by 谷歌翻译

Building Segmentation on Satellite Images and Performance of Post-Processing Methods

Metehan Yalçın , Ahmet Alp Kindiroglu , Furkan Burak Bağcı , Ufuk Uyan , Mahiye Uluyağmur Öztürk

分类：计算机视觉

2022-12-28

Researchers are doing intensive work on satellite images due to the information it contains with the development of computer vision algorithms and the ease of accessibility to satellite images. Building segmentation of satellite images can be used for many potential applications such as city, agricultural, and communication network planning. However, since no dataset exists for every region, the model trained in a region must gain generality. In this study, we trained several models in China and post-processing work was done on the best model selected among them. These models are evaluated in the Chicago region of the INRIA dataset. As can be seen from the results, although state-of-art results in this area have not been achieved, the results are promising. We aim to present our initial experimental results of a building segmentation from satellite images in this study.

translated by 谷歌翻译

Semi-Supervised Domain Adaptation for Semantic Segmentation of Roads from Satellite Images

Ahmet Alp Kindiroglu , Metehan Yalçın , Furkan Burak Bağcı , Mahiye Uluyağmur Öztürk

分类：计算机视觉

2022-12-26

This paper presents the preliminary findings of a semi-supervised segmentation method for extracting roads from sattelite images. Artificial Neural Networks and image segmentation methods are among the most successful methods for extracting road data from satellite images. However, these models require large amounts of training data from different regions to achieve high accuracy rates. In cases where this data needs to be of more quantity or quality, it is a standard method to train deep neural networks by transferring knowledge from annotated data obtained from different sources. This study proposes a method that performs path segmentation with semi-supervised learning methods. A semi-supervised field adaptation method based on pseudo-labeling and Minimum Class Confusion method has been proposed, and it has been observed to increase performance in targeted datasets.

translated by 谷歌翻译

A Hypervolume Based Approach to Rank Intuitionistic Fuzzy Sets and Its Extension to Multi-criteria Decision Making Under Uncertainty

Kaan Deveci , Onder Guler

分类：人工智能

2022-12-25

Ranking intuitionistic fuzzy sets with distance based ranking methods requires to calculate the distance between intuitionistic fuzzy set and a reference point which is known to have either maximum (positive ideal solution) or minimum (negative ideal solution) value. These group of approaches assume that as the distance of an intuitionistic fuzzy set to the reference point is decreases, the similarity of intuitionistic fuzzy set with that point increases. This is a misconception because an intuitionistic fuzzy set which has the shortest distance to positive ideal solution does not have to be the furthest from negative ideal solution for all circumstances when the distance function is nonlinear. This paper gives a mathematical proof of why this assumption is not valid for any of the non-linear distance functions and suggests a hypervolume based ranking approach as an alternative to distance based ranking. In addition, the suggested ranking approach is extended as a new multicriteria decision making method, HyperVolume based ASsessment (HVAS). HVAS is applied for multicriteria assessment of Turkey's energy alternatives. Results are compared with three well known distance based multicriteria decision making methods (TOPSIS, VIKOR, and CODAS).

translated by 谷歌翻译

Building Height Prediction with Instance Segmentation

Furkan Burak Bagci , Ahmet Alp Kindriroglu , Metehan Yalcin , Ufuk Uyan , Mahiye Uluyagmur Ozturk

分类：计算机视觉

2022-12-19

Extracting building heights from satellite images is an active research area used in many fields such as telecommunications, city planning, etc. Many studies utilize DSM (Digital Surface Models) generated with lidars or stereo images for this purpose. Predicting the height of the buildings using only RGB images is challenging due to the insufficient amount of data, low data quality, variations of building types, different angles of light and shadow, etc. In this study, we present an instance segmentation-based building height extraction method to predict building masks with their respective heights from a single RGB satellite image. We used satellite images with building height annotations of certain cities along with an open-source satellite dataset with the transfer learning approach. We reached, the bounding box mAP 59, the mask mAP 52.6, and the average accuracy value of 70% for buildings belonging to each height class in our test set.

translated by 谷歌翻译

Real Time Incremental Image Mosaicking Without Use of Any Camera Parameter

Suleyman Melih Portakal , Ahmet Alp Kindiroglu , Mahiye Uluyagmur Ozturk

分类：计算机视觉

2022-12-05

Over the past decade, there has been a significant increase in the use of Unmanned Aerial Vehicles (UAVs) to support a wide variety of missions, such as remote surveillance, vehicle tracking, and object detection. For problems involving processing of areas larger than a single image, the mosaicking of UAV imagery is a necessary step. Real-time image mosaicking is used for missions that requires fast response like search and rescue missions. It typically requires information from additional sensors, such as Global Position System (GPS) and Inertial Measurement Unit (IMU), to facilitate direct orientation, or 3D reconstruction approaches to recover the camera poses. This paper proposes a UAV-based system for real-time creation of incremental mosaics which does not require either direct or indirect camera parameters such as orientation information. Inspired by previous approaches, in the mosaicking process, feature extraction from images, matching of similar key points between images, finding homography matrix to warp and align images, and blending images to obtain mosaics better looking, plays important roles in the achievement of the high quality result. Edge detection is used in the blending step as a novel approach. Experimental results show that real-time incremental image mosaicking process can be completed satisfactorily and without need for any additional camera parameters.

translated by 谷歌翻译

Minimum Class Confusion based Transfer for Land Cover Segmentation in Rural and Urban Regions

Metehan Yalçın , Ahmet Alp Kındıroğlu , Furkan Burak Bağcı , Ufuk Uyan , Mahiye Uluyağmur Öztürk

分类：计算机视觉

2022-12-05

Transfer Learning methods are widely used in satellite image segmentation problems and improve performance upon classical supervised learning methods. In this study, we present a semantic segmentation method that allows us to make land cover maps by using transfer learning methods. We compare models trained in low-resolution images with insufficient data for the targeted region or zoom level. In order to boost performance on target data we experiment with models trained with unsupervised, semi-supervised and supervised transfer learning approaches, including satellite images from public datasets and other unlabeled sources. According to experimental results, transfer learning improves segmentation performance 3.4% MIoU (Mean Intersection over Union) in rural regions and 12.9% MIoU in urban regions. We observed that transfer learning is more effective when two datasets share a comparable zoom level and are labeled with identical rules; otherwise, semi-supervised learning is more effective by using the data as unlabeled. In addition, experiments showed that HRNet outperformed building segmentation approaches in multi-class segmentation.

translated by 谷歌翻译

Explainable Artificial Intelligence for Improved Modeling of Processes

Riza Velioglu , Jan Philip Göpfert , André Artelt , Barbara Hammer

分类：机器学习 | 人工智能

2022-12-01

In modern business processes, the amount of data collected has increased substantially in recent years. Because this data can potentially yield valuable insights, automated knowledge extraction based on process mining has been proposed, among other techniques, to provide users with intuitive access to the information contained therein. At present, the majority of technologies aim to reconstruct explicit business process models. These are directly interpretable but limited concerning the integration of diverse and real-valued information sources. On the other hand, Machine Learning (ML) benefits from the vast amount of data available and can deal with high-dimensional sources, yet it has rarely been applied to being used in processes. In this contribution, we evaluate the capability of modern Transformer architectures as well as more classical ML technologies of modeling process regularities, as can be quantitatively evaluated by their prediction capability. In addition, we demonstrate the capability of attentional properties and feature relevance determination by highlighting features that are crucial to the processes' predictive abilities. We demonstrate the efficacy of our approach using five benchmark datasets and show that the ML models are capable of predicting critical outcomes and that the attention mechanisms or XAI components offer new insights into the underlying processes.

translated by 谷歌翻译

Towards Human-Centred Explainability Benchmarks For Text Classification

Viktor Schlegel , Erick Mendez-Guzman , Riza Batista-Navarro

分类：自然语言处理

2022-11-10

Progress on many Natural Language Processing (NLP) tasks, such as text classification, is driven by objective, reproducible and scalable evaluation via publicly available benchmarks. However, these are not always representative of real-world scenarios where text classifiers are employed, such as sentiment analysis or misinformation detection. In this position paper, we put forward two points that aim to alleviate this problem. First, we propose to extend text classification benchmarks to evaluate the explainability of text classifiers. We review challenges associated with objectively evaluating the capabilities to produce valid explanations which leads us to the second main point: We propose to ground these benchmarks in human-centred applications, for example by using social media, gamification or to learn explainability metrics from human judgements.

translated by 谷歌翻译

Variational Bayes for robust radar single object tracking

Alp Sarı , Tak Kaneko , Lense H. M. Swaenen , Wouter M. Kouw

分类：计算机视觉 | 机器学习

2022-09-28

我们通过雷达来解决对象跟踪以及处理异常值的当前最新方法的鲁棒性。标准跟踪算法从雷达图像空间中提取检测到在过滤阶段使用它。过滤由卡尔曼过滤器进行，该滤波器假设高斯分布式噪声。但是，此假设并不能说明大型建模错误，并导致突然动作期间的跟踪性能差。我们将高斯总和过滤器（多假设跟踪器的单对象变体）作为基线，并通过与比高斯更重的分布建模工艺噪声来提出修改。变分贝叶斯提供了一种快速，计算上便宜的推理算法。我们的模拟表明，在存在过程离群值的情况下，稳健的跟踪器在跟踪单个对象时优于高斯总和过滤器。

translated by 谷歌翻译